Generating Multilingual Documents from a Knowledge Base: 

The TECHDOC Project 
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Abstract 

TECHDOC is an implemented system 
demonstrating the feasibility of generating 
multilingual technical documents on the ba- 
sis of a language-independent knowledge 
base. Its application domain is user and 
maintenance instructions, which are pro- 
duced from underlying plan structures rep- 
resenting the activities, the participating ob- 
jects with their properties, relations, and so 
on. This paper gives a brief outline of the 
system architecture and discusses some re- 
cent developments in the project: the addi- 
tion of actual event simulation in the KB, 
steps towards a document authoring tool, 
and a multimodal user interface. 



1 Overview 

1.1 Project idea 

The availability of technical documents in 
multiple languages is a problem of increas- 
ing significance. Not only do consumers 
demand adequate documentation in their 
mother tongue; there are also legal require- 
ments, e.g., with respect to the upcoming 
European common market: the product reli- 
ability act forces merchants to offer complete 
technical documentation in the consumer's 
native language. The need to provide such 
a massive amount of multilingual material is 
likely to exceed both the capacities of human 
translators as well as those of machine trans- 
lation technology currently available. Our 
work in the TECHDOC project is motivated 
by the feeling that this situation calls for 
investigating a potential alternative: to ex- 
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ploit natural language generation technology 
in order to help overcome the documentation 
problem. 

TECHDOC operates in the domain of 
technical manuals, which was selected for 
two principal reasons. On the one hand, 
they represent "real-world" texts that are 
actually useful: the domain is practical in- 
stead of a "toy world". On the other hand, 
the language that is used in such manuals 
tends to be relatively simple; one mostly 
finds straightforward instructions that have 
been written with the intention to produce 
text that can be readily understood by a per- 
son who is executing some maintenance ac- 
tivity. Moreover, as our initial analyses in 
the first phase of TECHDOC had shown, the 
structure of manual sections is largely uni- 
form and amenable to formalization. 

1.2 Outline of the generation pro- 
cess 

TECHDOC produces maintenance instruc- 
tions in English, German and French. The 
system is based on a KB encoding techni- 
cal domain knowledge as well as schematic 
text structure in LOOM, a KL-ONE di- 
alect [?]. The macro structure of a man- 
ual section is captured by schemas say- 
ing that (if appropriate) one first talks 
about the location of the object to be re- 
paired/maintained, then about possible re- 
placement parts/substances; next, the activ- 
ities are described, which fall into the three 
general categories of checking some attribute 
(e.g., a fluid level), adding a substance and 
replacing a part/substance. These actions 
are represented as plans in the traditional AI 



sense, i.e. with pre- and postconditions, and 
with recursive structure (steps can be elab- 
orated through complete refinement plans). 

These representations are mapped onto a 
language-independent document representa- 
tion that also captures its micro structure by 
means of RST relations [?] with a number 
of specific annotations (e.g., a proposition is 
to be expressed as an instruction, giving rise 
to imperative mood). This document rep- 
resentation is successively transformed into 
a sequence of sentence plans (together with 
formatting instructions in a selectable tar- 
get format; SGML, DTpjX, Zmacs and — for 
screen output -- slightly formatted ASCII 
are currently supported), which are handed 
over to sentence generators. For English, we 
use 'Penman' and its sentence planning lan- 
guage (SPL) as input terms. To produce 
German and French text, we have imple- 
mented a German version of Penman's gram- 
mar (NIGEL), which is enhanced by a mor- 
phology module, and a fragment of a French 
grammar in the same way. 

For a more detained description of the sys- 
tem architecture see [?1. 



bility of general technical knowledge has 
been a concern from the beginning. For 
instance, knowledge about various types of 
tanks (with or without imprinted scales, dip- 
sticks, drain bolts) is encoded on an abstract 
level in the inheritance network (the 'mid- 
dle model'), and the particular tanks found 
in the engine domain are attached at the 
lower end. Similarly, we have an abstract 
model of connections (plugs, bolts, etc.), 
their properties, and the actions pertaining 
to them (plug-in connections can be merely 
connected or disconnected, screw connec- 
tions can be tightly or loosely connected, or 
disconnected). Objects with the function- 
ality of connections (e.g., spark plugs) ap- 
pear at the bottom of the hierarchy. Thus, 
when the system is transferred to a different 
technical domain — as experienced recently 
when we moved to aircraft manuals — , large 
parts of the abstract representation levels are 
re-usable. 
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2 The Knowledge Base 

The Knowledge Base is encoded in LOOM. 
In addition to the standard KL-ONE func- 
tionality (structured inheritance, separa- 
tion of terminological and assertional knowl- 
edge), LOOM supports object-oriented and 
also rule-based programming. 

In addition to the 'Upper Model' of the 
Penman generator (a basic ontology that 
reflects semantic distinctions made by lan- 
guage, [?]) more than 1000 concepts and in- 
stances constitute the TECHDOC KB. They 
encode the technical knowledge as well as the 
plan structures that serve as input to the 
generation process. The domains currently 
modeled are end consumer activities in car 
maintenance and some technical procedures 
from an aircraft maintenance manual. 

One of the central aims in the design phi- 
losophy of the TECHDOC knowledge base is 
the separation of domain-independent tech- 
nical knowledge and specific concepts per- 
taining to the particular domain: the porta- 



The first task undertaken in TECHDOC was 
a thorough analysis of a corpus of pages from 
multilingual manuals in terms of content as 
well as structure of the sections. A text rep- 
resentation level was sought that captured 
the commonalities of the correponding sec- 
tions of the German, English and French 
texts, i.e. that was not tailored towards one 
of the specific languages (for a discussion of 
representation levels in multilingual genera- 
tion, see [?]). Rhetorical Structure Theory 
(RST) turned out to be a useful formalism: 
for almost every section we investigated, the 
RST trees for the different language versions 
were identical. 

Our work with RST gave rise to a number 
of new discourse relations that we found use- 
ful in analyzing our texts. Also, we discov- 
ered several general problems with the the- 
ory, regarding the status of minimal units 
for the analysis and the requirement that the 
text representation be a tree structure all the 
time (instead of a general graph) . These and 
other experiences with RST are reported in 



4 Recent Developments 

4.1 Event simulation in the knowl- 
edge base 

We developed a detailled representation of 
knowledge about actions. Together with 
an action concept, preconditions and post- 
conditions can be defined in a declarative 
way. The preconditions can be checked 
against the current state of the knowledge 
base (via LOOM's ASK queries). If the pre- 
conditions hold, the action can be performed 
and the postconditions are communicated to 
the knowledge base (with the TELL facil- 
ity of LOOM). This typically leads to re- 
classification of certain technical objects in- 
volved. With the help of LOOM's produc- 
tion rule mechanism, additional actions ei- 
ther in the knowledge base or on an output 
medium (e.g., for visualization) can be trig- 
gered. In this mode, instruction generation 
is a by-product of simulating the actions that 
the instructions pertain to. 

Being able to take the current state of 
a technical device into account, as in this 
simulation mode, is a prerequisite for up- 
coming interactive applications of instruc- 
tion generation: devices equipped with ade- 
quate sensory instruments produce raw data 
that can be fed directly into the knowledge 
base. Thereby, the specific situation of the 
device, e.g., the car, drives the instruction 
generation process, so that only the truly rel- 
evant information is given to the user. 

4.2 Towards a document authoring 
tool 

A first version of an authoring tool has been 
designed and implemented and tested with a 
number of users. The authoring tool allows 
to interactively build up knowledge base in- 
stances of maintenance plans, including the 
actions and objects involved, and to convert 
them immediately into documents in the se- 
lected languages. At any time, the tool takes 
the current state of the knowledge base into 
account: all menus offering selections dy- 



namically construct their selection lists, so 
that only options of applicable types are of- 
fered. 

4.3 From text generation to a mul- 
timodal information system 

The generated texts are now displayed with 
words, groups and phrases and whole sen- 
tences being mouse-sensitive and - - when 
selected - - offering menus with applicable 
queries to be directed to the underlying 
knowledge base instances. This allows for 
a number of tasks to be performed on the 
generated surface texts, for example: 

• pronouns can be asked about their an- 
tecedent referent, 

• linguistic items in the output for one 
language can be asked about their cor- 
responding items in the other languages 
output, 

• objects can be asked about their loca- 
tion, answered by a suitable graphic il- 
lustration, 

• actions can be asked for more detailled 
instructions on how to perform them, 
answered by a short video sequence. 

In essence, these facilities have paved the 
way to move from static, inactive strings as 
output to an active and dynamic interface 
for the associated knowledge sources and 
their various presentation modalities. The 
key is that all information types (lexemes 
in various languages, images and object's lo- 
cation therein, and video sequences) are as- 
sociated with the underlying KB instances, 
which are in turn linked to their referents 
in the mouse-sensitive output text. Figure 
1 shows a sample screen, where the user has 
just asked for additional "location" informa- 
tion about the dipstick, by clicking on the 
word in one of the text output windows. 

5 Implementation Note 

The current version of TECHDOC is run- 
ning on Sun Sparc stations with LUCID 
CommonLISP 1.4 and LOOM 1.41 (a port 
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German; 



Kontrollieren 

Das Auto auf ebenenn Boden abstellen, und den 
Motor abstellen. Dann das Motoroel 
ueberpruefen t Den |Tauchmessstab | entfernen, 
inn abwischen, ihn wiedereinf uehren, ihn 
wieder entfernen, und den Stand ablesen + Er 
sollte zwischen der oberen Pegelrnarke und 
der unteren Pegelrnarke sein t 



English; 



Checking 

Park the car on level grounds and switch off 
the engine* Then, check the engine oil. 
Remove the |dipstick.J wipe off it, reinsert 
it, remove it again, and read the ievel + It 
should be between the upper nark and the 
lower mark. 



French; 



La verification 

Garer la voiture sur la surface de niveau, 
puis eteindre le rnoteur. Puis, verifier 1' 
huile moteur, Retirer la |jaugej essuyer la, 
reintroduire la, retirer l r a nouveau, puis 
voir le niveau,. II devoir etre entre le 
repere superieur et le repere inferieur. 






. 



Figure 1: Trilingual output and interactive graphic support 



to LOOM 2.1 is underway), and a PEN- 
MAN version from 1991. The user interface 
is based on the CommonLISP Motif interface 
package CLM and the application building 
tool GINA [?]. 
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